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Abstract 

An XOR function is a function of the form g{x,y) — f{x © y), for some boolean 
function f on n bits. We study the quantum and classical communication complexity 
of XOR functions. In the case of exact protocols, we completely characterise one-way 
communication complexity for all /. We also show that, when / is monotone, g's 
quantum and classical complexities are quadratically related, and that when / is a 
linear threshold function, g's quantum complexity is 0(n). More generally, we make a 
structural conjecture about the Fourier spectra of boolean functions which, if true, would 
imply that the quantum and classical exact communication complexities of all XOR 
functions are asymptotically equivalent. We give two randomised classical protocols for 
general XOR functions which are efficient for certain functions, and a third protocol for 
linear threshold functions with high margin. These protocols operate in the symmetric 
message passing model with shared randomness. 



1 Introduction 

The communication complexity model was introduced by Yao in 1979 [28]. In its most 
basic form, the model considers two separated parties (Alice and Bob), who attempt to 
compute some function /(x, y) of their joint inputs x, y, while using the minimum amount 
of communication. They may be required to compute / exactly (the deterministic model), 
or may be allowed some constant probability of error (the bounded- error model). The 
considerable theoretical interest of this simple model, as well as its practical applications, 
have motivated its intensive study over the last thirty years. 

More recently, the model of quantum communication complexity was introduced [291 IE] • 
In this model, Alice and Bob are allowed to send and receive qubits (quantum bits), with 
the goal being to reduce the amount of communication required. It has recently been 
shown that, when the function /(x, y) to be computed is partial (there is some promise on 
the inputs x, y), there can be an exponential separation between quantum and classical 
communication complexity [191 [5]. No separation beyond quadratic is known for total 
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functions, and it is conjectured that the quantum and classical communication complexities 
of total functions are in fact polynomially related. However, this conjecture has resisted 
proof in both the exact and bounded-error models. 

A natural way to make progress on the conjecture is to attempt to prove it for restricted 
types of function. The class of functions g{x,y) = f{x A y), where / is a boolean function, 
has received particular attention. These functions seem to have first been considered by 
Buhrman and de Wolf [2], who showed that deterministic quantum and classical commu- 
nication complexities are asymptotically equivalent for all symmetric functions / (/ is said 
to be symmetric if f{z) depends only on \z\, the Hamming weight of z). They also showed 
that these communication complexity measures are polynomially related if / is a monotone 
function (/ is said to be monotone if f{x\/y) > max{/(j;), /(y)} for all x, y). It was several 
more years before Razborov proved, in a fundamental paper [20], that the bounded-error 
quantum and classical communication complexities of symmetric functions in this model are 
polynomially related; see [22] for a recent alternative proof. In other recent work, Sherstov 
has shown that the conjecture does indeed hold if one is required to compute both f{x V y) 
and f{xf\y) [2^, and Shi and Zhu have proven lower bounds in a model with a more general 
notion of composition of functions ^25j . 

This paper is concerned with another natural class of functions, where Alice and Bob 
each receive an n-bit string, and the function they need to compute is defined as g{x, y) = 
f{x © y) for some boolean function /. These functions were recently studied by Shi and 
Zhang \2^ , who refer to them as "XOR functions" . Shi and Zhang essentially determined 
the quantum and classical communication complexity of all XOR functions where / is 
symmetric, up to polylogarithmic factor^. In particular, using Fourier-analytic techniques, 
they showed that the exact quantum communication complexity of all symmetric XOR 
functions (excluding a few trivial special cases) is J7(n). Bounded-error communication 
complexity is dealt with via a reduction to the previous result of Razborov ^20j. The special 
case where / is a threshold function {f{z) = 1 ^ \x\ > t for some t) had been considered 
previously by Huang et al [SJ. 

In the present work, we consider more general classes of XOR function, for which we 
find new quantum lower bounds and classical upper bounds on communication complexity. 
As well as monotone functions, another class of function in which we will be interested is 
linear threshold functions, f : {0, 1}" —?■ {0, 1} is said to be a linear threshold function 
(LTF) if 



if WiXi < 9 

(1) 

WiXi > 

=1 

where Wi, 9 £M. The set {wi} are known as the weights of /, and 9 is called the threshold of 
/. These functions have been much studied in the computer science literature and elsewhere; 
see El] for a review. 



^Some general quantum lower bounds, which are tight for some XOR functions, had previously been 
obtained by Buhrman and de Wolf [2], and also Klauck 
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In the case of the model of communication complexity studied here, LTFs are a partic- 
ularly natural class to consider, for the following reason. Imagine that Alice and Bob each 
have a document, and they wish to determine whether their documents differ significantly. 
In practice, differing at one position may be more significant than differing at another (con- 
sider a bioinformatics application where mutations are more likely at particular points on 
a chromosome). This can be modelled by the task of determining whether a weighted sum 
of differences between bits held by Alice and bits held by Bob is above a threshold, which 
is exactly the problem of computing an XOR function defined by an LTF. 

The main results we obtain are as follows. First, we completely characterise the de- 
terministic quantum and classical one-way communication complexity of XOR functions 
g{x,y) = f{x © y) in terms of an algebraic property of /, its Fourier dimension [6]. We 
observe a relationship between deterministic two-way communication complexity and the 
parity decision tree model introduced in the context of computational learning theory by 
Kushilevitz and Mansour [T3], and make a structural conjecture about the Fourier spectra of 
boolean functions which, if true, would imply that the quantum and classical deterministic 
two-way communication complexity of all XOR functions are asymptotically equivalent. 

Turning to probabilistic communication complexity, we first show that one-way proto- 
cols cannot be efficient for all XOR functions: indeed, one-way quantum communication 
complexity can be exponentially larger than two-way classical communication complexity. 
On the other hand, there are randomised classical protocols in the more restrictive simul- 
taneous message passing (SMP) model with shared randomnes^, which are efficient for 
particular XOR functions. Using a previous result of Grolmusz [7], one can give an efficient 
protocol for those functions g{x^ y) = f{x © y) where / has a very low spectral norm (/'s 
Fourier spectrum is "narrow"). We give a new protocol that is efficient for functions where 
/ is close to a parity function (/'s Fourier spectrum is "tall"), and in particular for functions 
where / takes the value 1 on a small number of inputs. 

Specialising to particular types of XOR function, we first show that the deterministic 
quantum and classical two-way communication complexities of all monotone XOR functions 
are quadratically related. Specialising further, we show that the deterministic two-way 
communication complexity of all XOR functions where / is an LTF depending on n bits is 
0(n). Finally, we give a randomised communication protocol for computing LTFs in the 
SMP model with shared randomness, which is efficient provided that the margin of the LTF 
in question is high. The protocol generalises previous results [30^ [8] on computing threshold 
functions. 

These results are all given more formally in Section [1.21 below. In order to state them, 
we will first require some definitions. 

^See Section [1.11 for the definition of tliis and other terms in this introduction. 
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1.1 Preliminaries 



1.1.1 Query complexity and boolean functions 

We will use a number of mostly standard notions from the field of query complexity (see 
the review [3] for further details). Let / : {0,1}" {0,1} be a function of n bits. The 
deterministic decision tree complexity of /, written D{f), is the minimal number of queries 
to the input variables (xi, . . . , Xn) that are necessary to evaluate f{x) with certainty, for any 
input X. A somewhat less familiar complexity measure is the parity decision tree complexity 
D®{f). This is the minimum number of queries necessary to compute f{x) with certainty on 
any input x, where each query the algorithm makes is the parity of any subset of the n bits 
of /'s input. Note that D®{f) can be considerably smaller than D{f); a trivial example is 
given by taking / to be the parity function on n bits, where D[f) = n, but D®{f) = 1. This 
model was previously studied by Kushilevitz and Mansour pi], who showed that functions 
with low parity decision tree complexity can be learnt efficiently from membership queries. 

A boolean function is a function on the boolean cube {0, 1}" that takes one of at most 
two values on all inputs. When studying the query or communication complexity of boolean 
functions, we are free to relabel these values, as of course this choice makes no difference 
to the complexity. In particular, we say that both / : {0, 1}" — > {0, 1} and /' : {0, 1}" — >■ 
{1, —1} are boolean functions. Any boolean function / : {0, 1}" — ?■ {0, 1} can be written 
uniquely as a multilinear polynomial in n variables over F2; deg2(/) denotes the degree of 
this polynomial. The sensitivity of a boolean function /, written is defined as the 

maximum, over all bit strings x, of the number of neighbours y oi x such that f{y) 7^ fix). 

1.1.2 Communication complexity 

We study several standard models of communication complexity (see the book [15] for 
further details). In all models, two parties (Alice and Bob) each receive n-bit strings x, 
y (respectively), and share a string of public random bits. Their goal is to compute some 
boolean function g(x, y) using the minimum amount of communication. The matrix M^y = 
g{x, y) is known as the communication matrix of g. 

The communication complexity of g in a given model is the total number of bits that 
are required to be transmitted to compute g. In the two-way model of communication 
complexity, Alice and Bob take it in turns to send bits to each other; we will assume that 
Alice speaks first and Bob has to output g{x,y). Define D^'^{g) to be the mininum total 
number of bits required to be transmitted for any classical deterministic protocol to compute 
g{x, y) on any input. Similarly, let R^{g) denote the number of bits required in the worst 
case for any classical randomised protocol to compute g{x^ y) with success probability at 
least 2/3 on every input (the "2" refers to 2-sided error). 

There are quantum generalisations of these models, in which Alice and Bob are allowed 
to send and receive qubits (quantum bits) [29, 12j. We also allow them to share an arbitrary 
prior entangled quantum state. The total number of qubits required to be transmitted 
between Alice and Bob for them to compute g exactly and with bounded error will be 
denoted by Q'^£{g) and Qf^ig), respectively. See [26] for a good introduction to quantum 
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communication complexity. 

Two more restricted scenarios we consider are the one-way and simultaneous message 
passing (SMP) models. In the one-way model, Alice sends a single message to Bob, who must 
then use this message and his own input to evaluate g{x, y). The bounded-error classical and 
quantum complexities in this model will be denoted by R^ig)-, Q\{g)-, respectively. A more 
restricted setting still is the SMP model. Here, Alice and Bob each send a single message 
to a referee, who performs some computation on the messages and outputs g{x,y). The 
randomised communication complexity of g in this model is denoted by note that, 

in this paper, we assume that Alice and Bob are still allowed to share public randomness, 
which the referee can also see. 

1.1.3 Fourier analysis 

We will make heavy use of some basic ideas from the field of Fourier analysis on the group 
(see [H] or |27j for excellent introductions to this area). Let [n] denote the set {1, . . . ,n}. 
Then for any positive integer n, the set of 2" parity functions xs '■ {0,1}" — > {1,-1}, 
Xsix) = (— which are indexed by subsets of [n], are known as the characters of 
the group . Let / : {0, 1}" — )• M be a function on the boolean cube. Then the Fourier 
coefficients of / are the set of coefficients, indexed by subsets SC. [n], 

hS) = ^ E Xs(x)f{x). 
The p-norms of / on the Fourier side are defined as 

ii/iip= I E 1/(^)1' 

\SC[n] 

with the special cases ||/||o = |supp/| (where supp/ denotes the set {S : f{S) / 0}), 
ll/lloo = max5 \f{S)\; of course, the former is not actually a norm. The norm ||/||i is known 
as the spectral norm of /. Parseval's equality states that 

ii/ii2 = ^ E /(^)'- 

xe{o,i}" 

We frequently identify n-bit strings with their corresponding subsets of [n] (that is, if x is an 
n-bit string, and 5 is the subset of [n] whose characteristic vector is x, then f{x) = f{S)). 
The notation /^"^ denotes the function whose Fourier coefficients are all shifted by T: 
= f{SAT), with SAT denoting the symmetric difference of the sets S and T. 
Similarly define /®^(x) = f{x © y). One can easily verify that xsArix) = Xs{x)xTix) for 
any S, T, and similarly xs{x ® y) = Xs{x)xs{y)- The Fourier dimensionality of /, dim/, 
is the smallest k such that the Fourier spectrum of / lies in a fc-dimensional subspace of 
{0, 1}". Finally, note that when we consider the Fourier transform of a boolean function /, 
we will always assume that / is given in the form / : {0, 1}" — )■ {1, —1}. 
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1.1.4 Linear threshold functions 



We give some assumptions and definitions related to LTFs. Let / be an LTF as in eqn. 
([T]). In general, the weights {wi} may be negative, and hence / may not be monotone but 
only locally monotone (or unate). However, for the purposes of understanding query and 
communication complexity, it suffices to assume that the weights are indeed positive, as this 
may be simulated by local complementation of the individual bits. We also assume that the 
weights are given in non-increasing order, i.e. wi > W2 > ■ ■ ■ > Wn- Define mj, where j = 
or j = 1, as 



and define the margin of / as m = min{mo,m.i}. We assume that there is no x such that 
Yll=i WiXi = 9, so the margin is strictly positive. 

1.2 Statement of results 

Now we are equipped with definitions, the main results that we obtain can be stated con- 
cisely as follows. 

• Section l2.ll If g is an XOR function, then D'^^'^ig) = Qe (g) = dim/. 

• Section [212] For any positive integer m, there is an XOR function g such that D'^'^{g) = 
0(m), but Ql{g) = Jl(2™). 

• Section [2^31 For any XOR function g, D^'^{g) = 0{Q'^£{g)), if the following conjecture 
holds: For any boolean function /, there exists a subset T C [n] such that | supp(/) PI 
supp(/^"^)| > ir||/||o, for some constant < K < 1. 

• SectionEl Let g{x,y) = f{x®y) be an XOR function. Then R^^^P^^g) = 0{\\f\\l), 
and also R\\'P'^''{g) = 0(log(2"-^(l - ||/||oo)))- The former result is a special case of a 
theorem of Grolmusz |7]; we give a simplified proof. 

• Section [U Let g{x,y) = f{x © y) be an XOR function. If / is monotone, then 
D'^'ig) = 0(Qf (5)2). If / is an LTF and depends on n bits, then Qf (5) = n{n). 

• Section 14.21 Let g{x,y) = f{x y) be an XOR function where / is an LTF with 
margin m and threshold 6. Then RW^P^'^g) = 0{{e/mf). 

We now turn to proving these results. 

2 Communication complexity of general XOR functions 

2.1 Deterministic one-way communication complexity 

We begin by noting that the deterministic one-way communication complexity of all XOR 
functions has a simple characterisation. 
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Proposition 1. Let g{x,y) = f{x(By) be an XOR function. Then 



Proof. It is well-known [T5] that D^'^'^(g) = [log2 nrows(5)] , where nrows(5') denotes the 
number of distinct rows in the communication matrix of g, and Klauck showed that the 
same is true for deterministic one-way quantum communication [9]. Now it holds that 

nrows(5) = \U,.fJ= feni = E ^ 



a;e{0,l}" '"^^ ■' J J\ xe{0,l}" '"^^ ■' 

2" 2"- 



\{y-f®y = f}\ |{y: =OVsGsupp/}| 

where in the penultimate equality we use the fact (which follows easily from Fourier duality) 
that / = /®^ if and only if the function Xy ' f = /• This implies that there is no s € supp / 
such that {y, s) = 1, where the inner product is taken over . □ 



2.2 Separation between one-way and two-way communication complexity 

We now establish that there can be an exponential gap between the one-way (quantum, 
bounded-error) and two-way (classical, deterministic) communication complexity of XOR 
function^, using a VC-dimension argument. The VC-dimension of a matrix M, VC-dim(M), 
is the largest k such that there exists a 2^= X k submatrix M' of M whose rows are all dis- 
tinct. It was previously shown by Klauck [10] that VC-dimension gives a lower bound on 
bounded-error quantum communication complexity: 

Theorem 2 (Klauck [10]). Let M be the communication matrix of some function f. Then 
Ql{f) = n{YC-dim{M)). 

We have the following proposition. 

Proposition 3. Let x be an {m + 2'^)-bit string divided into an m-bit "address" register a, 
and a 2'^-bit "data" register d. Let f{x) be the addressing function, which returns the data 
bit at a given address: f{x) = da. Finally, let g be the XOR function g{x,y) = f{x © y). 
Then D'^'ig) = 0{m), but Q\{g) = Vt{2'^). 

Proof. For the upper bound, note that D{f) = m + 1: a decision tree for / can just evaluate 
the m address bits, followed by the one relevant data bit. For the lower bound, we will 
show that VC-dim(M) > 2™", with the result following from Theorem [2j Let Sx be the set 
{(a,02™)} for a € {0, 1}™, and let Sy be the set {(O'",^)} for d G {0, l}^™. For ah pairs of 
2"^-bit strings d ^ d' , there exists an a such that da ^ d'^- Thus, for all y ^ y' G Sy, there 
is an X G such that f{x © y) / f{x © y'), implying that VC-dim(M) > 2™. □ 

■^Note that this is a stronger separation than between quantum and randomised communication complex- 
ity. 



7 



Many of the most efficient known communication protocols for XOR functions require 
only one-way communication [HI [23], and indeed it was left as an open question in [24] 
whether all symmetric functions could be computed optimally using a one-way protocol. 
The above proposition implies that this cannot be true in a more general setting. 



2.3 Parity decision trees and Fourier spectra 

We turn to the question of finding classical upper bounds, and quantum lower bounds, on the 
two-way deterministic communication complexity of XOR functions. This is where Fourier 
analysis becomes very useful, in particular because of the following natural observation, 
which appears to have first been written down by Shi and Zhang 



Observation 4. Let g{x,y) = /(x®y) be an XOR function. Then D^'^{g) > log2 ||/||o 
Qf(5)>ilog2||/||o. 



Proof. Assume / is a function on n bits, and let M be the communication matrix of g. 
Then it is easy to see that M is diagonalised by the Fourier transform over Zg, and the 
eigenvalues of M are given by f's Fourier coefficients, scaled appropriately. Indeed, letting 
F denote the matrix of this Fourier transform in the usual basis indexed by n-bit strings, 
F'xy = (—1)^^'^^ (with the inner product being taken over F2), we have 

u,v£{0,l}" u,v€{0,l}"- 

which is equal to 2"/(x) if x = y, and otherwise. So the rank of M is equal to ||/||o- The 
observation now follows from known results lower bounding the classical [16] and quantum 
[21 [T7] communication complexity of a function by the log of the rank of its communication 
matrix. □ 

In the other direction, the following observation gives a natural way of finding upper 
bounds on the deterministic communication complexity of XOR functions. 

Observation 5. Let g{x,y) = f{x © y) be an XOR function. Then D^''{g) < 2D®{f). 

Proof. Given a parity decision tree for / that uses at most D®{f) queries on any input, 
a communication protocol for g can be obtained as follows. Each query to a subset S of 
the bits of the string x ® y can be simulated by Alice sending the parity ©jg^ xi to Bob, 
who reciprocates by sending her ©jg^yj. This clearly enables each of them to compute 

Therefore, it would suffice to prove the following conjecture to show that quantum and 
classical communication complexity of XOR functions is polynomially related. 
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Conjecture 6. Let f : {0, 1}" — ?> {1, —1} be a boolean function. Then 

l)®(/) = 0(polylog(||/||o)). 

It appears to be fairly difficult to reason about parity decision trees. We now give a 
conjecture which is merely about the structure of the Fourier spectrum of boolean functions 
and which, if true, would imply Conjecture [H 

Conjecture 7. Let f : {0, 1}" — )■ {1, —1} be a boolean function. Then there exist universal 
constants C , < K < 1 such that, if ||/||o > C, there exists a subset T C [n] such that 
|supp(/)nsupp(/^^)| >K|l/|lo. 

In order to show that Conjecture [3 does indeed imply Conjecture [6l we will need the 
following lemma. 

Lemma 8. Let f : {0, !}"■ — t- M 6e some function on the boolean cube, and let T C [n] be 
arbitrary. Define the function g by 



9{x) 



fix) [xt{x) = r] 
f{x®t) [xT{x) = -rl 



for some t such that XT{t) = —1; oLnd some r = ±1. Then g{x) = f{x) wherever xt{x) = r, 
and for all S, g{S) = |((1 + Xs{t))){f{S) + rf{SAT)). In particular, for all S such that 
xsit) = -I, g{S) = 0. 

Proof. The fact that g{x) = f{x) wherever xt{x) = r is immediate; we now prove the 
second claim. We have 



9{S) 



Y:,XT{x)=r x,XT{x)=-r ) 

= 2~2" ( ^ (1 + ''XT(x))/(x)x5(a;) + Y {\-rxT{x))f{x^t)xsix)\ 

\x6{0,l}" xG{0,1}" / 

= 2~2" I ^ (1 + '^XT(a;))/(a;)X5(a;) + ^ Xs{t)(\ ^ rxT{x))f{x)xs{x)\ 
\xe{o,i}" xe{o,i}" / 

= E {^^rxT{x))f{x)xs{x) 

a::e{0,l}" 

= ^iif2a(/(S) + ./(SAr)), 

which is clearly zero wherever xs^) = D 

Now consider an algorithm which attempts to evaluate /(x) for some unknown input 
X by making a query to the parity of the bits in a subset T C [n], which is equivalent to 
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querying the function XTi^)- Given the knowledge that Xt{x) = r, for r = ±1, in order 
to evaluate f{x), it suffices to evaluate g{x) for any function g of our choice, as long as 
g{x) = f[x) wherever Xt{x) = r. That is, we can replace / with g. 

If we pick g according to the procedure of Lemma [HI then as XT^t) = —1) for each S 
either g{S) = 0, or g{SAT) = 0. This implies that whatever the value of r, the number of 
nonzero Fourier coefficients of g is upper bounded by half of the number of subsets S where 
either f{S) 7^ or f(SAT) ^ 0; this quantity can be written down concisely as 



supp(/) U supp(/ 



supp(/) n supp(/ 



So, if there exists a subset T such that | supp(/) fl supp(/'^^)| > i^||/||o, for some constant 
< K < 1, then ||^||o will be at most a constant fraction of ||/||o. If there exists such 
a subset for all boolean functions, then after repeating this procedure O(log||/||o) times 
(querying the parity of the bits in the best subset each time), / would be reduced to a 
constant function. This would thus imply that D®{f) = 0(log ||/||o)- 



3 Randomised protocols for XOR functions 

In this section we discuss randomised classical protocols for computing general XOR func- 
tions. The first protocol we give is efficient for functions whose spectral norm is while 
the second is efficient for functions which are close to a parity function on some subset of 
the bits. These protocols can be seen as two different generalisations of a protocol for com- 
puting the equality function {g{x, y) = 1 4^ x = y), which satisfies both of these conditions. 
We give a third (!) generalisation of this protocol in Section [4. 2[ 



Proposition 9 (Grolmusz [7J). Let g{x,y) = f{x © y) he an XOR function with f : 
{0,1}" ^ {1,-1}. Then R\\'P-\g) = 0(||/||?). 

Proof. We give a variant of a protocol of Kremer, Nisan and Ron for computing the 
inner product of two vectors, which will achieve the specified complexity. Using their shared 
randomness, Alice and Bob pick k subsets {Si} from the family of subsets of [n], for some k 
to be determined, where the set S is picked with probability |/(<S')|/||/||i. For each subset 
5*4, Alice sends the referee the bit XSi{x), and Bob sends the referee the bit XSiiv)- The 
referee uses these k bits to compute 

k k 
Y,XsMxsMsm{f{Si)) = Y,XsA^(By)sgn{f{Si)), 

i=l 1=1 

and outputs 1 if the result is positive, and —1 if negative. To see correctness of this 
protocol, note that for each i, xSt{x © y)sgn(/(5j)) is a sample from a random variable 
whose expectation is 

-J- 2^ Xs{x®y)f{S) = - 

11 SQn\ 



''This is a special case of a result of Grolmusz [?]; we give a simplified proof. 
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Standard Chernoff bound arguments thus give that the number of samples k required to 
determine whether /(x © y) > 0, with a constant probabihty of success, is 0(||/||f)- □ 

One can use the previous example of the addressing function to show that the above 
protocol is close to optimal in terms of its dependence on the spectral norm, even among 
all one-way quantum protocols. Indeed, the addressing function with an m-bit address 
register has spectral norm 2'", and by Proposition [3] has one-way quantum communication 
complexity Q{2"^). 

The second protocol rests on the following lemma. 

Lemma 10. Let f : {0, 1}" {1, —1} and f : {0, 1}" — > {1, —1} be boolean functions that 
disagree on at most m inputs, and let g{x,y) = f{x © y) and g{x,y) = f{x © y) be the 
corresponding XOR functions. Then < R\\'P'^^(g) + O(logm). 

Proof. Let S be the set of inputs z such that f{z) ^ f{z). We give a protocol in the SMP 
model with shared randomness that determines whether (x © y) G S, using 0(log bits 
of communication. This clearly implies the lemma: to get a protocol for g, it suffices to 
carry out the protocol for g, then check whether {x(By) £ S, and if so, negate the result. In 
order to do this check, we use a simple generalisation of a well-known protocol for testing 
equality [15], which was also used by Gavinsky, Kempe and de Wolf ^ in their protocol for 
computing the Hamming distance. We give it explicitly for completeness. 

Using their shared randomness, Alice and Bob create k n-bit strings {ri,...,rfc}, for 
some k to be determined. Alice sends the referee the /c-bit string a = ((x,ri), . . . , {x,rj.)) 
that lists their inner products with x over F2, and Bob does the same with the string b = 
{{y, ri), . . . , (y, rfc)). The referee outputs 1 if there is some z € S such that Oj © 6i = {z, ri) 
for all i, and otherwise outputs —1. We have 

Pr[ai © fej = {z,ri)] = Pr[(a; © y,rj) = {z,ri)], 

which will equal 1 if x©y = z, and 1/2 otherwise. Thus the probability, for any given z £ S 
with x®y ^ z, that the referee incorrectly outputs 1 is 1/2^^. Using a union bound over all 
z E S, it suffices to take k = 0(log \S\) to achieve a constant probability of success. □ 

Note that the above lemma still holds for stronger models of communication (e.g. R^, 
R^), and that a similar result does not apparently hold for the communication complexity 
of general functions. It is now straightforward to see the following proposition. 

Proposition 11. Let g{x,y) = f{x © y) be an XOR function with f : {0, l}" {1, — 1}. 
Assume that there is some parity function xt such that f disagrees with xt on m inputs. 
Then R^^'P^^{g) = O(logm). Ln other words, 

i?ll'P%) = 0(log(2"-i(l-||/||oo))). 

As a special case, if f takes the value 1 (or the value —1) on at most m inputs, i?ll'P"''(y) = 
0(log m). 
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Proof. It is clear that any function g{x, y) = XT{x®y)i with T nonempty, has = 2 

(by a protocol where Alice and Bob each send the referee the parity of the bits of their 
inputs in the set T). The result follows from Lemma [TOl □ 

4 Communication complexity of monotone functions 

We now show that the two-way deterministic communication complexity of monotone XOR 
functions is almost determined by the rank. We will need the following lemma relating 
sensitivity and degree over F2; the proof is essentially the same as a previously known 
result relating sensitivity and degree over M p]. 

Lemma 12. Let f : {0, 1}" — )• {0, 1} be a monotone boolean function. Then s{f) < deg2(/). 

Proof. It is well known (see pLt Lemma 3], for example) that the degree of / over F2 is 
precisely the size of the largest subfunction of / that takes the value 1 on an odd number 
of inputs. Now consider a point x that achieves maximal sensitivity, i.e. /(y) 7^ f{x) for 
exactly s{f) neighbours y of x. Assume wlog f{x) = 1. Now, by the monotonicity of 
/, all the points z in the subcube traced out by x and all the neighbours y must have 
f{z) = (of the points in this subcube, x must have maximal Hamming weight; for each y 
neighbouring x, f{y) = 0; and all other points in this subcube must have lower Hamming 
weight). So / takes the value 1 on exactly one point in this dimension s(/) subcube, so 
deg2(/) > s{f). □ 

Proposition 13. Let f : {0, 1}" {0, 1} be a monotone boolean function. Define g{x, y) = 
f{x®y). T/ien Z?-(r7) <4(log2||/||o)2=4(log2 rank 5)2. 

Proof. The result follows from 

Z)-(<7) < 2D{f) < 4s(/)2 < 4deg2(/)2 < 4(log2 ||/||o)^ 

The inequalities are proven in order, as follows. For the first, if Alice and Bob have a 
decision tree for /, they can use it to compute g with only an overhead of a factor of 2 |15] . 
The second is proven as Corollary 5 of [3], while the third inequality follows from Lemma 
[T2I The fourth is Lemma 3 of [1] (or see [6l eqn. (2)]). 

□ 

This proposition immediately implies the following corollary. 

Corollary 14. Let f : {0, 1}" {0, 1} be a monotone boolean function. Define g{x,y) = 
f{x®y). ThenD^^g) < IQ Q'^Hgf . 

4.1 Lower bounds on communication complexity of LTFs 

We turn to a class of XOR functions that is more specialised still: those based on linear 
threshold functions. We will see that the deterministic communication complexity of these 
functions is always Q{n). We will need the following lemma, which does not appear to have 
been noted previously in the literature. 
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Lemma 15. Let f be an LTF that depends on n hits. Then s{f) > \{n + l)/2], and this 
result is best possible. 

Proof. Write the weights in non-increasing order, wi > ■ ■ ■ > Wn. Then, as / depends on 
all n variables, there exists an assignment to the bits xi, . . . , Xn-i such that 

n-1 

^WiXi +Wn> 0, 

1=1 

but 

n-1 

wiXi < 9. 

i=l 

Call this assignment (zi, . . . As Wn is the smallest of the weights, flipping any of 

the bits of the string = (zi, . . . , 0) from to 1 will change the value of /, as will 
flipping any of the bits of the string z^ = {zi,. . . ,Zn-i, 1) from 1 to 0. Thus s(/) is lower 
bounded by the maximum of {n — \z^\, \z^\}, which is at least \{n + l)/2]. The Majority 
function has sensitivity \(n + l)/2] and demonstrates that this result is best possible. □ 

Proposition 16. Let f be an LTF that depends on n bits, and set g{x,y) = f{x(By). Then 
D'^%g) > \{n + 1)/21 and Qf (/) > \{n + l)/4]. 

Proof. In the proof of Proposition [13] it was shown that, if / is monotone, log2 lank^g) > 
s{f). The present proposition now follows from Lemma [T5] and known results lower bound- 
ing classical [16] and quantum [21 |T7] communication complexity by the log of the rank of 
g. □ 

4.2 Upper bounds on communication complexity of LTFs 

The final result of this paper is an upper bound on the randomised classical communication 
complexity of LTFs, derived by giving an explicit protocol for such functions in the SMP 
model with shared randomness. Formally, we have the following result. 

Proposition 17. Let g{x,y) = f{x © y), where f is an LTF with threshold 9 and margin 
m. Then R\\^P^''{g) = 0{{9/mf). 

Our protocol can be seen as a generalisation of Yao's protocol for the Hamming dis- 
tance function |30j . which in turn can be understood as a generalisation of the well-known 
constant-communication protocol for computing equality of two bit strings. It proceeds as 
follows. 

1. Alice and Bob use their shared randomness to generate k = 0{{6 /m)^) n-bit strings 
ri, . . . ,rfc, where the i'th bit of each string rj is equal to 1 with probability pi, for 
some probabilities {pi} which will be determined later. 
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2. For each j, Alice and Bob each compute the bits aj = {rj,x) and bj = {rj,y) (respec- 
tively), where the inner product is taken over Fg, and each send the resulting k bits 
to the referee. 

3. The referee computes s = ^ Yl^=i{^j ® ^j) ^^'^ outputs 1 if 

s > 1 ^1 _ i (^(1 _ l/0)^-'"o + (1 _ i/ef+mi 

where tuq, mi are defined as in Section ri.l.4l and we assume that mo, mi, and 6 are 
all greater than 1, rescaling if necessary. 

We now prove that there is a choice of {pi} such that this protocol succeeds with constant 
probability. We will need the following lemma. 

Lemma 18. Let x he an arbitrary n-bit string, and let r be a randomly generated n-bit 
string such that Pr[rj = 1] = pi for some {pi}. Then 



Pr[(r, x) 



^ = l(^^-fl(^-2p^X,)^ 



Proof. For 1 < k < n, define Qk = Prr[®(Li ^i-^* ~ Then, for 2 < k < n, 

k-l k-1 

Qk = (1 - Pr[0 r,x, = 1]) Pr[rfcXfc = 1] + Pr[0 nxi = 1](1 - Pr[rfcXfc = 1]) 

1=1 i=l 

= Qk-iC^ -'^PkXk) +PkXk, 

and also Qn = Prr[(r, x) = 1]. Now the lemma follows by induction on k, noting that the 
base case 



Qi = Pixi = ^ ^1 - 11(1 - 2piXi)j 



□ 



Now the central idea behind our approach is as follows. Consider the string z = x ® y. 
The referee needs to output 1 if Y^^=i '^i^i > 0. He does not know X]"^^ WiZi, but if we pick 
Pi to be small and proportional to Wi, the quantity 

n 

J{{l-2piZi), 

i=l 

which the referee can estimate using Lemma [T8l should give an estimate of Yll=i'^i^i^ ^ 
the first order terms are proportional to this sum. We will not in fact quite do this, but 
will do something easier to analyse. If we pick 

Pi = \{l-{l-2aD, 
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for some constant < a < 1 to be determined, we get 

Pr[(r, = 1] = 1 - - 2ar^^^ = i (i _ (i _ 2a)^"-^ -•^») . (2) 

Write V = WiZi. Our task is now to choose a value for a that makes the two cases 

V < 9, V > 6 easy to distinguish. As the most difficult cases to distinguish will be when 
f ~ 0, we achieve this by choosing a to maximise the absolute value of the derivative 

(1 - (1 - 2ar) = -\{l- 2ay ln(l - 2a), 
evaluated aX v = 6. For < a < 1/2 this derivative is positive, and we have 

— f--(l-2a)^ln(l-2Q)^ = (1 - 2a)^(l + ln(l - 2a)). 
eta \ 2 / 

Setting this expression equal to and solving for a gives 

Inserting this value for a into eqn. (I2|), we obtain 

Pr[(r,z) = 1] = 1 (l - (1 - l/0)EILi-.^.) . 

Our problem has therefore been reduced to determining whether Yll=i '^i^i > ^i using 
samples from this distribution. The remainder of the proof is a standard Chernoff bound 
argument. Let X denote the sum of k i.i.d. random variables Xj, which take values in {0, 1}, 
with Pr[Xj = 1] = /i. Then the inequality 

Pr[|X - k^i\ >5\< 2e-^'/^^'^ 

holds, implying that one can distinguish two different distributions with means ^, where 
1^ — /u'l > e, with 0(l/e^) samples from Xj. 

Recall that | X^^Li WiZi — 9\ > m for all z. Thus, for any z, z' such that f{z) ^ fiz'), 
we have 

|Pr[(r,z)=l]-Pr[(r,z') = l]| > 1 ((1 - 1/0)^— - (1 - l/<+™ 
r r 2 \ 

- ^<-v 

= n{m/e), 



i(l-l/< ((1 - 1/0)-™ - (1 - 1/0)-) 



which implies that it suffices for the referee to take 0{{9 /m)'^) samples from the distribution 
to determine whether Y^^=i WiZi > 9 with constant probability. The threshold value picked 
in the protocol is simply halfway between the two worst-case values of z. 
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5 Conclusions 



We have presented a number of partial results on the communication complexity of XOR 
functions, but the initial question still remains: are the quantum and classical communi- 
cation complexities of XOR functions polynomially related? We believe that the class of 
XOR functions is of particular interest in the context of communication complexity because 
of the connection to Fourier analysis of boolean functions, and remain hopeful that this 
conjecture is tractable. The little-studied classical model of parity decision tree complexity 
also appears to be of some interest in its own right; the connection with the "width" of the 
Fourier spectrum is an interesting contrast to the usual decision tree complexity, which is 
polynomially related to the "height" (degree) of the Fourier spectrum. 

A final question: can the protocol of Section 14.21 be improved to use, for example, 
0{{6 /m) \og{6 /m)) communication, in a similar way to Huang et al's protocol for the Ham- 
ming distance problem [8]? 
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